Anonymizing Binary Tables is APX-hard

نویسندگان

  • Paola Bonizzoni
  • Gianluca Della Vedova
  • Riccardo Dondi
چکیده

The problem of publishing personal data without giving up privacy is increasingly important. An interesting formalization is the k-anonymization, where all rows in a table are clustered in sets of at least k records, and all the entries for which records in the same cluster have different values are suppressed. The problem has been shown to be NP-hard when the records values are over a ternary alphabet and k = 3. In this paper we show that the problem is not only NP-hard, but also APX-hard, when the records values are over a binary alphabet and k = 3.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-approximability of weighted multiple sequence alignment for arbitrary metrics

We prove that the multiple sequence alignment problem with weighted sum-of-pairs score is APX-hard for arbitrary metric scoring functions over the binary alphabet. This holds even when the weights are restricted to zero and one.

متن کامل

On the Complexity of the Single Individual SNP Haplotyping Problem

We present several new results pertaining to haplotyping. These results concern the combinatorial problem of reconstructing haplotypes from incomplete and/or imperfectly sequenced haplotype fragments. We consider the complexity of the problems Minimum Error Correction (MEC) and Longest Haplotype Reconstruction (LHR) for different restrictions on the input data. Specifically, we look at the gapl...

متن کامل

Anonymizing Unstructured Data

In this paper we consider the problem of anonymizing datasets in which each individual is associated with a set of items that constitute private information about the individual. Illustrative datasets include market-basket datasets and search engine query logs. We formalize the notion of k-anonymity for set-valued data as a variant of the k-anonymity model for traditional relational datasets. W...

متن کامل

On the Complexity of the $k$-Anonymization Problem

We study the problem of anonymizing tables containing personal information before releasing them for public use. One of the formulations considered in this context is the k-anonymization problem: given a table, suppress a minimum number of cells so that in the transformed table, each row is identical to atleast k − 1 other rows. The problem is known to be NP-hard and MAXSNP-hard; but in the kno...

متن کامل

Improved approximation bounds for the minimum rainbow subgraph problem

In this paper we consider the Minimum Rainbow Subgraph problem (MRS): Given a graph G with n vertices whose edges are coloured with p colours, find a subgraph F ⊆ G of minimum order and with p edges such that F contains each colour exactly once. We present a polynomial time ( 2 + ( 2 + )∆)-approximation algorithm for the MRS problem for an arbitrary small positive . This improves the previously...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/0707.0421  شماره 

صفحات  -

تاریخ انتشار 2007